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ILLUMINATION DEVICE FOR PROJECTION SYSTEM 
AND METHOD FOR FABRICATING 

Cross-reference to Related Applications 

The present application is a division of U.S. Patent Application 
No. 09/422,398, filed October 21, 1999, which claims the benefit of the priority date 
of the following U. S. Provisional Application Nos. 60/105,281 filed October 22, 
1998 and 60/119,780, filed February 11, 1999, each of which applications are hereby 
incorporated in their entirety into the present application by reference. 

Reference is also made to co-pending application entitled "Illumination 
Device for Projection System and Method for Fabricating," (Atty Docket 
No. 81468/284420) being filed concurrentiy herewith and hereby incorporated in its 
entirety into the present application by reference. 

Background of the Invention 

Optical lithography has been one of the principal driving forces behind the 
continual improvements in the size and performance of the integrated circuit (IC) 
since its inception. Feature resolution down to 0.30 (am is now routine using the 365 
nm mercury (Hg) i-line wavelength and optical projection tools operating at 
numerical apertures above 0.55 with aberration levels below 0.05 X RMS OPD. The 
industry is at a point where resolution is limited for current optical lithographic 
technologies. In order to extend capabilities toward sub-0.25 |xm, modifications in 
source wavelength, optics, illumination, masking, and process technology are reqxxired 
and are getting very much attention. 

However, as devices get smaller, the photomask pattern becomes finer. Fine 
pattems diffract light and thus detract from imaging the photomask onto the surface of 
a wafer. FIG. la shows what happens when a photomask with a fine pattern 6 having 
a high frequency (pitch 2d is about several microns), is illuminated through a 
projection lens system 7. The fine pattern 6 is illuminated along a direction 
perpendicular to the surface thereof and it diffracts the light that passes through the 
mask 6. Diffraction rays 3-5 caused by the pattern include a zero-th order diffraction 
ray 5 directed in the same direction as the direction of advancement of the input ray, 
and higher order diffraction rays such as positive and negative first order diffraction 
rays 3, 4, for example, directed in directions different from the input ray. Among 
these diffraction rays, those of particular diffraction orders such as, for example, the 
zero-th order diffraction ray and positive and negative first order diffraction rays 3,5, 
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5 are incident on a pupil 1 of the projection lens system 7. Then, after passing through 
the pupil 1, these rays are directed to an image plane of the projection lens system, 
whereby an image of the fine pattem 6 is formed on the image plane. In this type of 
unage formation, the ray components which are contributable to the contrast of the 
image are higher order diffraction rays. If the frequency of a fine pattem increases, it 
10 raises a problem that an optical system does not receive higher order diffraction 
rays. Therefore, the contrast of the image degrades and, ultimately, the imaging itself 
becomes unattainable. 

As will be shown below, some solutions to this problem rely upon shaping the rays of 
light impinging the photomask in order to provide off-axis illumination to compensate 

1 5 for the lost contrast due to diffraction. These techniques rely upon optical systems for 
shaping the rays that illuminate the photomask. 

In considering potential strategies for sub-0.25 [om lithography, the 
identification of purely optical issues is difficult. Historically, the Rayleigh criteria 
for resolution (R) and depth of focus (DOF) has been utilized to evaluate tiae 

20 performance of a given technology: 

R = kiXyNA 
DOF = +/-k2?i/NA^ 

25 

where ki and k2 are process dependent factors, X is wavelength, and NA is numerical 
aperture. As wavelength is decreased and numerical aperture is increased, resolution 
capability hnproves. Considered along with the wavelength-linear and NA-quadratic 
loss in focal depth, reasonable estimates can be made for system performance. 

30 Innovations in lithography systems, materials and processes that are capable of 
producing improvements in resolution, focal depth, field size, and process 
performance are those that are considered most practical. 

The Hg lamp is a source well suited for photolithography and is relied on 
almost entirely for production of radiation in the 350-450 nm range. Excimer lasers 

3 5 xising argon fluoride (ArF) and krypton fluoride (KrF), which produce radiation at 1 93 
nm and 248 nm, respectively, are also used. DUV lithography at 248 nm is now 
being implemented into manufacturing operations and may be capable of resolution to 
0.18 jxm 

The control of the relative size of the illumination system numerical aperture 
40 has historically been used to optimize the performance of a lithographic projection 
tool. Control of this NA with respect to the projection systems objective lens NA 
allows for modification of spatial coherence at the mask plane, commonly referred to 

101666 2 



■^1 
1^1 



PATENT 
81468/284421 

partial coherence. This is accomplished through specification of the condenser lens 
pupil size with respect to the projection lens pupil in a Kohler illumination system. 
Essentially, this allows for manipulation of the optical processing of diffraction 
information. Optimization of the partial coherence of a projection imaging system is 
5 conventionally accomplished using full circular illuminator apertures. By controlling 
the distribution of diffraction information in the objective lens with the illuminator 
pupil size, maximum image modulation can be obtained. Illumination systems can be 
further refined by considering variations to full circular illumination apertures. A 
system where illumination is obliquely incident on the mask at an angle so that the 

10 zero-th and first dif&action orders are distributed on alternative sides of the optical 
axis may allow for improvements. Such an approach is generally referred to as ofF- 
axis illiMnination. The resulting two difJfraction orders can be sufficient for imaging. 
The minunum pitch resolution possible for this oblique condition of partially coherent 
illummation is O.SA/NA, one half that possible for conventional illumination. This is 

1 5 accomplished by limitmg illumination to two narrow beams, distributed at selected 
angles. The illumination angle is chosen uniquely for a given wavelength (X), 
numerical aperture (NA), and feature pitch (d) and can be calculated for dense 
features as sin^ (0.5A/d) for NA = 0.5A/d. The most significant impact of off axis 
illumination is realized when considering focal depth. In this case, the zero-th and 1st 

20 diffraction orders travel an identical path length regardless of the defocus amount 
The consequence is a depth of focus that is effectively infinite. 

In practice, limiting illumination to allow for one narrow beam or pair of 
beams leads to zero intensity. Also, imaging is limited to features oriented along one 
dkection in an x-y plane. To overcome this, an annular or ring aperture has been 

25 employed that delivers illumination at angles needed witii a finite ring width to allow 
for some finite intensity. The resulting focal depth is less than that for the ideal case, 
but improvement over a fiiU circular aperture can be achieved. For most integrated 
circuit appUcation, features are limited to horizontal and vertical orientation, and a 
four-zone configuration may be more suitable. Here, zones are at diagonal positions 

30 oriented 45 degrees to horizontal and vertical mask features. Each beam is off-axis to 
all mask features, and minimal image degradation exists. Either the amular or the 
four-zone off-axis system can be optimized for a specific feature size, which would 
provide non-optimal illimiination for all otiiers. For features other than those tiiat are 
targeted and optimized for, higher frequency components do not overlap, and 

35 additional spatial frequency artifacts are introduced. This can lead to a possible 
degradation of imaging performance. 

When considering dense features (1:1 to 1:3 line to space duty ratio), 
modulation and focal depth improvement can be realized through proper choice of 
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illximination configuration and angle. For true isolated features, however, discrete 
diffraction orders would not exist; instead a continuous diffraction pattern is 
produced. Convolving such a frequency representation with either illumination zones 
or annular rings would result in diffraction information distributed over a range of 
5 angles. Truly isolated line performance is, therefore, not improved with off-axis 
illumination. When features are not completely isolated but have low density (>1 :3 
line to space duty ratio), the condition for optimum illumination will not be optimal 
for more dense features. Furthermore, the use of off-axis illumination is generally not 
required for the large pitch values that correspond to low density geometry. As dense 
10 and mostly isolated features are considered together in a field, it follows that the 
impact of off-axis illumination on these features will differ, and a large disparity in 
dense to isolated feature performance can result. 

One approach to generate off-axis illumination is to incorporate a metal 
aperture plate filter into the fly eye lens assembly of the projection system illuminator 
Cl 15 providing oblique illumination. A pattem on such a metal plate would have four 
quadmple openings (zones) with sizing and spacing set to allow diffraction order 
overlap for specific geometry sizing and duty ratio on the photomask, as disclosed in 
JP patent Laid-Open (KOKAI) PubHcation No.4-2675 15. Such an approach results in 
i^j a significant loss in intensity available to the mask, lowermg throughput and making 

20 the approach less than desirable. Additionally, the four circular openings need to be 
designed specifically for certain mask geometry and pitch and would not improve the 
performance of other geometry sizes and spacings. Large levels of mask biasing or 
mask optical proximity correction (OPC), where mask features are pre-distorted to 
produce desired image characteristics, would be required to allow for use of this 
25 approach with a variety of features. Filtering, by limiting its effective area, reduces 
the effect of the fly eye diftiiser on maximizing illumination uniformity. lUtimination 
uniformity may be degraded. This approach also limits the illumination profile to one 
having holes in a metal plate. That is, the masking metal must remain contiguous. 
The previous work in this area describes such methods using either two or four 
30 openings in the aperture plate: EP0500393, US5305054, US5673103, US563821 1, 
EP0496891, EP0486316, US 37,9252. 

Another approach to off-axis illumination using the four-zone configuration, 
which is disclosed in U.S. Patent No. 5,627,625, is to divide the illumination field of 
the projection system into beams that can be shaped to distribute off-axis illumination 
35 to the photomask. By incorporating the ability to shape off-axis illumination, 

throughput and flexibility of the exposure soxirce is maintained. Additionally, this 
approach allows for illumination that combines off-axis and on-axis (conventional) 
characteristics. By doing so, the improvement to dense features that are targeted with 
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off-axis illiunination is less significant than straight off-axis illumination. The 
performance of less dense features, however, is more optimal because of the more 
preferred on-axis illumination for these features. The result is a reduction in the 
optical proximity effect between dense and isolated features. Optimization is less 
dependent on feature geometry and more universal illumination conditions can be 
selected. 

A problem with this divided illumination approach is that it requires 
reconfiguration of the illumination system of a projection tool, a task that is not 
practical on existing tools or systems designed with other illumination systems. 
Additionally, the use of divided beam illumination Ihnits the fine control of beam 
shape, size, and position to that which is possible with optical components utilized in 
the system. Variations in shape, size, position, momber of beams, maximum aperture 
size, or other feature or lens specific variations to the illumination intensity profile 
become difficult without significant mechanical modifications. Some variations may 
not be practical or possible with this approach. This has significantly limited the 
acceptance or use of this approach in most integrated circuit fabrication operations. 

Summary of the Invention 

A shaped illumination approach is described that allows for off-axis 
illumination of a photomask in a projection exposure tool. It is necessary to control 
both the off-axis and the on-axis character of the illumination for instance so that 
dense line performance can be unproved and more isolated line performance can be 
maintained, i.e., optical proximity effect (hereinafter "OPE") is kept to a mimmum. 
Minimal OPE correction is desired to reduce mask complexity. There is also a desire 
for a flexible technique that can be incorporated into most existing or fixture 
generation projection exposure tools with a minimum amount of illumination system 
retrofitting. It is important that such an improvement be easily removed to allow a 
return to original operation conditions since it is expected that a given projection 
exposure system woxdd be used for a variety of applications. Such an approach can 
lead to optimizing illumination conditions, which can be incorporated into an 
exposure system as a more permanent condition. The invention also provides an 
improvement that allows for fine adjustment or modification to accommodate mask- 
specific or lens-specific characteristics is important as resolution and focal depth 
requirement are pushed beyond the capability of conventional optical lithography. 
Maintaining illumination throughput is also critical, as any loss in intensity will result 
in increased exposure time requirements. 
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The invention provides such a solution. It modifies existing illumination 
system by adding a masking aperture in the illumination pupil plane, fabricated as an 
optical component reticle, pattemed and dithered into a large number of elements to 
allow for control of the projected light distribution at the mask plane and inserted at 
5 the condenser lens pupil plane. This masking aperture comprises a translucent 

substrate and a masking film. The distribution of the intensity through the masking 
aperture in the illumination pupil plane, is determined to provide optimized 
illumination. The illumination region or regions exhibit varying intensity, which is 
accompUshed by creating a half-tone pattem via pixelation of the masking film, 
1 0 thereby allowing for maxhnum variation in illumination beyond the simple binary 
(clear or opaque) possible with earlier pupil plane filtering approaches. 

More specifically, the invention includes an aperture mask for an illumination 
system to provide controlled on-axis and ofif-axis illumination. The aperture mask 
acts as a diffraction element The aperture mask is divided into an array of elements 
1 5 and each element contains an array of pixels. Each of the elements is constructed 
% with a matrix of pixels. In the preferred embodiment the array of pixels is 8 x 8. The 

Si number of elements in tiie illumination array are generally larger than 3 x 3 and array 

S ^ sizes of 21 X 2 1 and 5 1 X 5 1 are an example, which correspond to 44 1 and 260 1 

W\ elements respectively. The elements are pattemed in accordance with a selected 

%^ 20 wavelength of mcident light to diffract the incident light into an illumination pattem 
£j for illuminating a photomask. 

The intensity is modulated by the intensity state of pixels within each element. 
The highest intensity element has all pixels clear or maximum intensity. Light of 
suitable wavelength passes through vdthout attenuation. An element with 64 pixels 
25 having dark or minimum intensity attenuates or blocks all light. Elements of intensity 
between none (0%) and all (100%) are created by the state of the pixels in a given 
element. Random patterns and other pattems between elements may produce artifacts 
similar to moire pattems. Such artifacts are undesired. I discovered that a dithered 
pattem using position dependent thresholds produced illumination pattems tiiat had 
30 little or no artifacts. 

I also discovered that traditional, optical shaping systems such as beam- 
splitters or diffractive optical elements and my diffiraction shaping system can each be 
improved by adding an illumination aperture. A square illumination aperture shows 
maximum improvement. It is located at or near the pupil of the illummation system. 
35 An aperture with a large, central square opening and an opaque border is inserted at 
the condenser lens pupil proximate to the fly's eye lens. It can also be designed into 
the masking aperture. The resulting illumination pattem fills the comers and squares 
the edges of the illumination pupil and limits the non-optimal fi-equency spreading 
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character along the x and y axes while optimizing the off-axis illumination angles. 
The square illumination aperture is especially helpful for imaging features that are 
oriented along x and y directions in the mask plane. The use of a central obscuration 
(square and also round shaped) applied in the same location can similarly be achieved 
and can lead to performance improvements described hereinafter. Furthermore, any 
combination of off-axis illumination with a square pupil or obscuration has potential 
to improve performance for geometry oriented in the x-y direction. This can mclude, 
but is not Umited to, round zones, elliptical zones, square zones, and annular slots 
(that is an annular ring masked off on x and y axis to form arc-shaped zones). 

Brief Description of the Drawings 

Figure la 1 is a schematic view of an illumination system for a photomask. 

Figure lb shows the zero-th and first positive and first negative diffraction rays. 

Figure Ic is a schematic view of a four-zone illumination arrangement. 

Figure 2 is a drawing showing diffraction orders (first and zero) for 130 nm x- 
oriented features using four-zone illumination, plotted in the frequency plane of the 
objective lens. Center sigma is 0.68 and radius sigma is 0.13, chosen to accommodate 
feature pitch values from 260 to 455 nm. 

Figure 3 is a drawing of the continuous tone intensity distribution for a masking 
aperture based on fo\ar distributed-intensity zones. 

Figure 4 is a plot of tiie x-y distribution of dithered bilevel masking cells for an 
illumination aperture consisting of four circular normal distributed-intensity zones 
placed at diagonal positions corresponding to off-axis illumination for geometry 
30 oriented in horizontal and vertical directions. The axes are divided into relative 
distances using the center and the edges of the mask. 

Figure 5 is a three dimensional plot of the x-y distribution shown in Figure 4. 

35 Figure 6 is a drawing of surface contours of the continuous tone intensity distribution 
for a masking aperture based on two distributed-intensity zones where the maximum 
off-axis angle is limited to one direction. 
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Figure 7 is a plot of the x-y distribution of dithered, bilevel masking cells for an 
illumination aperture consisting of four elliptical normal distributed-intensity zones 
placed at diagonal positions corresponding to off-axis illumination for geometry 
oriented in horizontal and vertical directions. The axes are divided into relative 
5 distances using the center and the edges of the mask. 

Figure 8 is a contour plot of the x-y distribution shown in Figure 7. 

Figure 9 is a three dimensional plot of the x-y distribution for an illumination 
10 aperture consisting of four stepped square distributed-intensity zones placed at 
diagonal positions corresponding to off-axis illumination for geometry oriented in 
horizontal and vertical directions. 

Figure 10 is a contour plot of the x-y distribution for an illumination aperture 
15 consisting of four stepped square distributed-intensity zones, as described for 
Figure 9. 

Figure 11 is a plot of the x-y distribution of dithered, bilevel masking cells for an 
illumination aperture consistmg of four elliptical normal distributed-mtensity zones, 
20 as described for Figure 9. 

Figure 12 is a schematic representation of the bilevel representation of 65 gray levels 
using the ordered dithering algorithm. 

25 Figure 13 is a graph showing normalized aerial image log-slope (NILS) vs focus for 
130 nm features using four-zone illumination, ac=0.68 and ar=0.20. 

Figure 14 is a graph showing normalized aerial image log-slope (NILS) vs. focus for 
130 nm features using distributed-intensity four-zone illumination (ac=0.68 and 
30 ar=0.30) and a circular hard stop (a=0.8). 

Figure 15a is a graph showing normalized image log slope (NILS) vs, focus for 130 
nm features using flie illumination shown in Figure 19. 

35 Figure 15b is a graph showing normalized image log slope (NILS) vs. focus for 130 
imi features using the illimiination shown in Figure 23. 
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Figure 15c is a graph showing normalized image log slope (NILS) vs. focus for 
annular ring illumination of 150 nm features using 248 nm wavelength and 0.63 NA, 
withduty ration of 1:1.5 to 1:5. 

5 Figure 15d is a graph showing normalized image log slope (NILS) vs. focus for 
square ring illumination of 150 nm features using 248 nm wavelength and 0.63 NA, 
with duty ration of 1:1.5 to 1:5. 

Figure 16 is a plot of the normalized image log slope, showing influence of a circular 
10 four-zone (cyc=0.7 and ar=0.2 on a zero intensity field) with 0. 1 waves of primary 
aberrations: astigmatism, tilt, spherical, and coma, through focus for a 130 nm 
features imaged with a 1 93nm imaging system using a numerical aperture of 0.6. 

Figure 17 is a plot of tiae normalized image log slope, using the bi-level 
15 representation of the distributed-intensity illumination described for this invention. 

Figure 18 is a contour plot of circxilar gaussian zones used to optimize energy 
distribution in the pupil, where energy is concentrated in the center of the pupil and at 
off-axis angles. 

20 

Figure 19 is a three dimensional plot of circular gaussian zones and a square limiting 
zone used to limit the on-axis illumination component while retaming off-axis zone 
energy. 

25 Figure 20 is a perspective view of a beamsplitter illumination system with a square 
aperture mask inserted in its pupil. 

Figure 21 shows how a square central obscuration can be included in an aperture to 
reduce the on-axis character of illumination and improve the performance of dense 
30 features. 

Figure 22 shows the square ring, off-axis source which delivers optimal off-axis 
illumination for dense features while providing on-axis illumination for more isolated 
features. 



35 



Figure 23 shows how the square rmg approach can be combined with the gaussian 
four-zone approach to emphasize comer zone effects. 
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ass. 



Figure 24 is a contour plot of a combined annular and gaussian four-zone aperture to 
improve the off-axis character of annular illumination. 

Figure 25 is an illumination system using diffractive optical elements and a square 
5 aperture. 

Figure 26 is a 5 1 x 51 element array of a dithered halftone pattern. 

Detailed Description of the Invention 

10 

For given ranges of feature types and/or sizes (types being lines, spaces, 
contacts, and dense or isolated combinations of these) exposure conditions are 
optimized by determining the average oflf-axis angle to accommodate all features. 
The distribution of off-axis angles is then determined based on the range of feature 

1 5 sizes of interest. As the range of feature sizes increases, the condition of off-axis 

illumination approaches a limit equivalent to the on-axis condition. Most commonly, 
duty ratios for a given feature size may range from 1 : 1 to 1 :6 line:space ratio. The 
spread of oflf-axis illumination angles is accomplished by shaping zones (for the two 
or more zone, including four-zone) or rings (for the annular) to produce continuous 

20 intensity distributions. 

As an example of the design considerations, 130 nm features are considered 
using a 193mn exposure wavelength and a 0.60 objective lens numerical aperture. 
Duty ratios from 1 : 1 to 1 : 6 are included. Features are considered dense with a duty 
ratio less than 1 :3. Table 1 shows the pitch values (p) for these dense features, along 

25 with the required axis center sigma and four-zone center sigma (ac) values for 
optimum oflf-axis illumination. Center sigma values on axis are determined as 
A,/(2p NA) , as shown in Figure 1 . Since diffraction order placement is determined by 
the projection of the four-zone onto the x or y axis, tiie center four-zone sigma values 
are larger by a (2)^'^ factor. 

30 In order to design an off-axis illumination configuration that can 

accommodate and enhance the range of pitch values in Table 1, zone position and 
radius values mxjst be chosen so that some amount of order overlap occurs for each 
case. This can be accompUshed if the four-zone center and the radius a, values are 
set as follows: 



35 



ac = SQRT(2)*(0.6H-0.35)/2 = 0.68 



ar>(0.61-0.35)/2 = 0.13 
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These choices for zone center and radius values correspond to the situation 
where zero and first diffraction orders begin to overlap for the extreme pitch values of 
260 and 455 nm, as shown in Figure 2. The (2)^^term does not factor into 
5 determination of cTr values since the zone radixis is projected directly onto the axes. 
The extent of order overlap for these extreme pitch values is, however, mostly 
ineffectual, and larger values are required to influence performance. A ar value of 
0.20 allows for significant order overlap (-20%) and is a more practical startmg value 
for further optimization. 
10 When imaging of both dense and isolated features, illumination that resembles 

both the strong four-zone and tiie conventional on-axis illumination is desirable. This 
can lead, for instance, to the four-zone illumination design where circular zones are 
replaced with continuous tone zones, as shown in Figure 3. 
Table 1. Pitch and sigma values for 1 30 nm features and 1 : 1 to 1 :2.5 duty ratios. 



Duty 


Pitch (mn) 


axis sigma 


four-zone sigma 


1:1 


260 


0.61 


0.87 


1:1.5 


325 


0.50 


0.71 


1:2 


390 


0.42 


0.59 


1:2.5 


455 


0.35 


0.49 



The masking aperture of this invention is a bilevel representation of the 
desired intensity distribution in the illuminator. It is desired to resemble a near 
continuously varying transition from open to opaque areas. To achieve this result, the 

25 illumination pattem is divided into a large number of elements and each element is a 
matrix of pixels. Dithering or pixelation of the continuous distribution of intensity is 
used for translation to the binary or bilevel masking aperture. The element array is 
large, consisting of, for instance, 5 x 5, 7 x 7, 9 x 9, 1 1 x 1 1, 21 x 21 or 51 x 5 1 
elements, but not limited to these cases. The illxmiination profile is divided into such 

30 an element array. Individual masking pixels are small, on the order of 10 to 100 pm, 
and are eitiher translucent or opaque. Their size is dependent on the size of the 
physical masking aperture. The continuous tone nature of the illumination intensity 
profile is translated by controlUng the spatial density of the bilevel display states on 
the masking aperture. Several decision rules may be implemented to produce the 

35 output distribution on the masking aperture. A fixed threshold technique is simplest 
in form, but an ordered dithering approach may be used to most effectively translate a 
continuous tone intensity profile into a bilevel masking aperture representation. 
Intensity values are compared to a position-dependent set of threshold values. 
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contained imnXn dither matrix. A set of selection rules repeats the dither matrix in 
a checkerboard arrangement over the illumination field. One key to this approach is 
the generation of a bilevel representation of the continuous tone image with the 
minimal amount of low spatial frequency noise. In other words, the occurrence of 
5 texture, granularity, or other artifacts is reduced to a minimum, allowing for the 
critical control of illumination uniformity demanded in projection exposure tools. 

The resulting bilevel representation of the continuous tone off-axis and/or on- 
axis illumination profile is then suitable for recording into a photo-sensitive or 
electron beam-sensitive resist material through use of mask pattem generator. Other 
1 0 approaches might use lithographic techniques common to lithographic or printing 
technologies. Such a resist material, when coated over an opaque fihn on translucent 
substrate, can allow for pattem delineation and creation of the masking aperture. 

In the present invention, the existing intensity distribution at the pupil plane of 
an illumination system for a projection exposure tool is modified through use of a 
1 5 bilevel masking aperture containing a masking cell representation of a continuous 
tone intensity distribution. Figures 4 and 5 show such distributions where four 
distributed-intensity zones allow for off-axis and on-axis illumination of a photomask 
that contains geometry oriented in horizontal and vertical directions only. A 
maximxmi circular dimension is defined by a limiting zone, designed to limit the 
20 maximum off-axis angle projected onto the mask. This is used to balance the off-axis 
illumination provided to the mask with the degree of coherency of the on-axis 
illumination. Smaller geometry requires higher levels of on-axis partial coherence, 
5; leading to larger limiting zones. The extent of the zone will generally correspond to 

positions near or beyond the maximum zone or angular position for the off-axis 
25 illumination, generally in, but not limited to, the range firom 0.5 to 1 .0. The intensity 
at any position located beyond this limiting zone is set to zero. The shape of this 
limiting zone is not necessarily circular, and selection of the shape will depend on the 
extent of feature orientation at the mask. Features constrained to one orientation only 
require limitation of off-axis illumination in one direction, resulting in assigning a 
30 value of zero to any element beyond the required x or y value corresponding to the 
limiting angle, as shown in Figure 6. This allows for maximum energy at angles with 
the desired range and can lead to improved imaging and throughput performance. 
Features constrained to two orientations only require that the aperture limit off-axis 
illumination angles in two orthogonal directions, leading to a non-circular or square 
35 two dimensional character of the limiting zone as shown in Figure 19. The invention 
allows for this tailoring of the illumination. 

If the existing illumination intensity distribution at the pupil plane of the 
illumination system is not uniform, the non-uniformity at the plane can be 
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deconvolved in accordance with the invention to result in a masking aperture that also 
incorporates compensation for non-uniformity. For example, many steppers provide a 
pupil that is guaranteed uniform (+/- 1%) for only 80% of its full opening. At 85% 
open, the vmiformity of illumination may vary up to +/- 10% or more. With the 
5 invention, the non-uniformity may be canceled or reduced to an acceptable level. 
The overlapping of the contmuous intensity regions in the center of the 
illumination field produces the on-axis character required for less dense features. The 
central intensity is generally greater than 0% and is commonly in the 1 0 to 50 % 
range. 

1 0 Illumination zones within the masking aperture control the illumination to 

mask and are designed to produce optimal off-axis, on-axis, or combined 
illumination. This invention allows for an infinite variety and number of such zones. 
Some are most desirable. 

Zones may be ckcular, elliptical, 45 degree elliptical (that is, eUiptical but 

15 oriented with axes at angles of 45 degrees and 135 degrees), square, or other shapes 
dependent on tiie desired distribution of diffraction information to match mask 
geometry requirements or specific lens behavior. The distribution of the energy m 
these zones or rings may be stepped, Gaussian, Lorentzian, or other similar shape. 
The kurtosis of gaussian distributions may be normal (mesokurtic), narrow 

20 (leptokurtic), or flat-topped (platykurtic), or combinations of these among zones. 
Skewness, or departure from symmetry of the distribution may be utilized for 
differential weighting of certain feature sizes. Circular symmetry may be best suited 
for most general cases and elliptical distributions can be utilized to accommodate x-y 
nonuniformities of the photomask or imparted by the projection lens (a result for 

25 instance of astigmatic or comatic aberration). Figures 7 and 8 show four elliptical 
normal distributed-mtensity zones placed at diagonal positions corresponding to off- 
axis illununation for geometry oriented in horizontal and vertical directions. Figure 
18 shov\« how rotated elliptical gaussian zones can offer improvements m intensity 
distribution for x and y oriented geometiy by concentrating some energy in the center 

30 of the pupil and the remaining energy disttibuted at four-zone angles. Here, rotation 
of the zone axis allows for a symmetrical distribution, hi this case, energy on the x/y 
axes is limited, reducing the non-optimal, on-axis illumination of targeted dense 
features. An increase m the efficiency of zero and first diffraction order overlap can 
result from such an illximination distribution. 

35 For imaging of geometry in two directions only (x and y only for instance), 

there is only a need to spread diffi^ction order mformation in the direction of 
geometry. By limitmg zone intensity distribution to x and y dnections, resulting m 
continuous intensity or stepped-square shaped zones, maxmium off-axis illummation 
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is maintained up to the maximum angle allowed by the zone dimensions. Beyond 
these angles, the degree of off-axis illumination is limited and can be tailored more 
specijHcally for the x and y oriented geometry. Figure 9 and 10 shows how stepped- 
square zones are implemented in an illumination profile. Figure 1 1 shows how this is 
5 translated into the bi-level representation. 

A square or rectangular shaped obscuration (or an inner limiting zone) 
emphasizes the off-axis illumination for feature pitch values whose frequency 
distribution falls beyond the chosen value (greater than lambda/ (w'^N A) where w is 
the full width obscuration value between 0 and 2). This is shown in Figure 21 for a 
1 0 gaussian off axis distribution where the obscuration is 30% of the full aperture width. 
Combining a square outer limiting zone and a square obscuration, an optimal 
condition of off axis illumination exists also. For features oriented in on direction 
only, only two zones are needed on an axis opposite to the feature direction. These 
zones can be slots or rectangles since spreading of energy in the direction of feature 
1 5 orientation is of no consequence to imaging performance and increases throughput. 
With two dimensional geometry, four slots are needed in x and y direction, resulting 
in a square ring, as shovm in Figure 22, This ring can also be considered as the 
combination of a square limiting zone and square obscuration. This rectangular ring 
source distribution can deliver off-axis illxmiination for features to 0.25lambda/NA, 
20 depending on the choice of the limiting outer square zone. This square ring source 
distribution can also be combined with other off axis approaches, such as a gaussian 
four-zone design. Figure 23 shows how a square ring source distribution is added to a 
gaussian four-zone design to produce results that are common to both approaches 
(that is better performance for more dense features out to 0.251ambda/NA and 
25 adequate through focus and through pitch imaging performance). 

Other combinations of source distributions are possible. Figure 24 shows how 
an annulur ring is combined with a gaussian four-zone design to emphasize the 
performance of more dense features than possible with the annular distribution alone. 
This approach can increase the off-axis character of the annulus and reduce the non- 
30 optimal on-axis character. 

The masking aperture varies the intensity of the transmitted light at any 
element by modulating the state of pixels in each element The highest intensity 
element has all pixels on or at maximum intensity. Light of suitable wavelength 
passes through without attenuation. In a preferred embodiment as shown in Figure 
35 12, an element with 64 pixels at a minimum intensity attenuates or blocks all light. 
Pixels of intensity between none and all are created by the number of pixels in a given 
element. 
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A masking aperture with a bilevel representation of such an illumination 
distribution is created by dithering the continuous tone images. Random techniques 
or fixed threshold techniques can be used. These fixed threshold techniques are 
based on decision rules where any intensity value greater than a threshold value (T) 
results in a transparent masking cell and a value less than T results in an opaque 
masking cell. The result is generally a high degree on banding in the bilevel 
representation. A slight improvement over this method is to replace T with equally 
distributed random numbers over the range 0 to S with a new random number 
generated for each intensity value. Less banding results but signal to noise is low. A 
superior approach to the dithered representation is possible by comparing image 
intensity values to position dependent thresholds contained in an n x n dither matrix, 
D". For a D*^ matrix, a matrix element D"ij is chosen based on a rule set that causes 
the dither matrix to be repeated in a checkerboard fashion over the entire image with 
minimum low spatial firequency noise. The proper choice of the dither matrix results 
in minimum texture or artifacts and maximum uniformity in intensity. In general, the 
optimum dither matrk is represented by the recursion relationship: 



where: 



1 1 
1 



To produce an 8x8 matrix to satisfy these optimization criteria, D8 becomes: 
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To utilize this dithering matrix, for example, a continuous tone element intensity 
5 distribution is divided into 64 levels. The lowest intensity level places a pixel in the 
masking element at the zero position An intensity value of 50% places pixels in the 
g ^ first 31 positions, and so forth. Figure 12 schematically depicts the bilevel 

Pi representation of 65 gray levels using the ordered dithering algorithm. Figure 26 

i£: shows how this approach is used in a 5 1 x 5 1 element array. Other possibilities, such 

f II 10 as 24evel, 4"level, 1 6-level, and so forth, are solved for in a similar manner. 

y i 

Cj The performance improvement for 130 nm features using a 193imi wavelength 

X and 0.60 lens numerical aperture is described using a normalized aerial image log- 

slope NILS metric (normalized to the feature size). This is the log of the slope of the 
O intensity image (or aerial image). Figures 13 and 14 show NILS plotted against focus 

5; 1 5 position for circular zone and a bi-level representation of distributed-intensity zone 
^ off-axis illumination where the central intensity is 26%. The ratios are the line to 

space size ratio, or duty ratio. The falloff of NILS across all feature duty ratios 
matches more closely with increasing amounts of defocus for the distributed-intensity 
zone illumination compared to the circular zone condition. The resulting impact on 
20 lithographic imaging is a reduction in the dense to isolated line proximity effect. 
Figure 15a shows the imaging improvement achieved with the square limiting zone 
using the illumination profile of Figure 19. Figure 15b shows the fiirther 
improvement using the hybrid design of Figure 23. Figures 1 5 a/d show a comparison 
of the square ring of Figure 22 with an annular (circular) ring for a 248imi 
25 wavelength, 0.63NA & 1 50imi features at 1 : 1 .5 duty ratio to 1 :5 duty ratio. Figure 
1 5c shows the performance for an annular or circular ring and Figure 1 5d shows the 
performance for the square ring. Illuminator dimensions are identical, only the shape 
differs. No circular or annular ring can match the performance of the square ring. 
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The amount of total intensity allowed to pass through a masking aperture will 
determine its acceptability in situations where exposure throughput is a concern, such 
as with the fabrication of integrated circuit devices. Table 2 is a comparison of the 
throughput efficiency of several variations of the distributed-intensity four-zone 
approach, measured relative to conventional illumination with a a of 0.7 and a strong 
circular zone approach, where off-axis illumination is provided by four circular zones 
on a zero transmission field. The worst case throughput is for the circular zone four- 
zone design, where the total intensity through the pupil is 27% of that for 
conventional illumination with 0=0.7. The distributed-intensity four-zone approach 
with a 0.7<y circular hard stop leads to 83% throughput and the same design with a 0.7 
half-width square hard stop results in 85% throughput. If the square limiting zone is 
increased in size to 0.8 half-width, the throughput increases to 93% and imaging 
performance remains comparable to the circular hard stop. This efficiency comes 
about because of the amount of energy allowed at the comers of the square pupil, 
where the diagonal approaches the full extent of the condenser lens pupil, or a a value 
near 1 .0. Comparison of intensity throughput is an important one as illumination 
modification is considered. If the illumination system of an exposure tool can allow 
full value, a =1 operation, this square hard stop variation of the distributed-intensity 
foiir-zone can lead to minimal losses. 



Table 2 

Illumination approach Relative Intensity 



Conventional, 0,7 a 1.00 

Circular four-zone 0.68ac, 0.2ar, 0.8a stop, 0% field 0.27 

Circular four-zone, 0.68ac, 0.2<Tr, 0.8a stop, 25% field 0.52 

Distributed-intensity four-zone, 0.68ac, 0.3ar, 0.7a circular stop 0.83 

Distributed-intensity four-zone, 0.68ac, 0.3ar, 0.7a square stop 0.85 

Distributed-intensity four-zone , 0.68ac, 0.3ar, 0.8a square stop 0.93 



For on-axis conventional illumination, lens aberrations are evaluated assuming 
fiill use of a lens pupil. With off-axis illumination, diffraction information is 
distributed selectively over the lens pupil, influencing the impact of aberrations on 
imaging. In general, astigmatic effects can worsen while spherical aberration and 
defocus effects can be improved. Coma induced image placement can be further 
aggravated with OAI unless rebalanced with tilt. Figure 14 shows how the 
normalized image log slope is impacted for circular four-zone (ac=0.7 and ar=0.2 on a 
zero intensity field) with 0.1 waves of primary aberrations: astigmatism, tilt, 
spherical, and coma, through focus for a 130 nm features imaged with a 193nm 
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imaging system, using a nimierical aperture of 0,6. Althoi:^ the effects of spherical 
aberration are minimal, the loss in NILS for astigmatism, tilt, and coma are 
significant. Figure 17 shows the same imaging process using the bi-level 
representation of the distributed-intensity four-zone illumination described in this 
invention. Similar results are obtained using dithered intensity circular, stepped 
square, and 45 degree elliptical zones. Here it can be seen that the influences of 
astigmatism and tilt are reduced at zero defocus values and at larger defocus values 
(both positive and negative) the influences of all aberrations are reduced substantially, 
as compared to the case shown in Figure 16, 

Implementation of the invention into existing illumination systems is 
accomplished via access to the illiraiination optical system. In one embodiment, 
pixilated half-tone illumination files are transferred lithographically onto a transparent 
substrate, such as fused silica, coated with a suitably opaque masking layer, such as 
chromium. A photoresist film coated over the metal coated Ixansparent substrate is 
exposed using optical, electron beam, or other methods by translating the bi-level 
illumination representation into a suitable machine-readable format. Photoresist 
development and subsequent etching of the underlying masking film allows transfer 
of the pattern to the masking aperture. An anti-reflective layer can be coated over the 
masking film prior to photoresist coating, exposure, and processing to reduce 
reflection, stray light, and flare effects in the illumination field of the exposure tool. 
An anti-reflective layer can be coated over the patterned aperture to match 
reflectances over the entire illumination field. Alignment of apertures is made 
possible by incorporating alignment fiducials on the masking apertures and on an 
aperture holders used to mount the invention into the exposure tool illumination 
system. Apertures can be inserted into as pupil plane of the illumination system. 

Those skilled in the art will understand that a number of the new results 
achieved with the above-described aperture mask may also be achieved with a 
traditional beamsplitter illumination system. For example, consider an illumination 
system such as the one shown and described in U. S. Patent No. 5,627,625. When an 
aperture mask 60 with a large, central square opening 64 and an opaque border 62 is 
inserted proximate to the fly's eye or in the lens pupil 19, the four beams fi'om the 
beam splitter 16 generate contour and three dimensional plots similar to Figures 18 
and 19. The beams fill the comers of the illumination pupil and limit the non-optimal 
fi"equency spreading character along the x and y axes while optimizing the off-axis 
illumination angles. The new results can also be achieved with an illumination 
system using dif&active optical elements (DOEs) to shape the illumination profile. 
An example of this illimiination shaping method is described, for instance, in U.S. 
Patent No. 5,926,257 (Canon) and U.S. Patent No. 5,631,721 (SVGL). Figure 25 
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shows how an aperture (60) with a large, central square opening (64) and an opaque 
border (62) is inserted proximate to the beam shaping optical system (2). This can 
allow for shaping with a square limiting zone. Additionally, the beam shaping optical 
system using diffractive optical elements can be tailored to produce similar results. 
That is especially helpful for imaging features that are oriented along x and y 
directions in the mask plane. The use of a central obscuration (square and also round 
shaped) can similarly be achieved and can lead to performance improvements 
described here. Furthermore, any combination of off-axis illumination with a square 
aperture or obscuration has potential to improve performance for geometry oriented in 
the x-y direction. This can include, but is not limited to, round zones, elliptical zones, 
square zones, and annular slots (that is an annular ring masked off on x and y axis to 
form arc-shaped zones). 

The masking aperture described above may be used alone or in combination 
with prior art techniques and apparatus. For example, the masking aperture of the 
invention may be combined with the four hole metal mask described in JP patent 
Laid-Open (KOKAI) Publication No.4-2675 15 and discussed in the background of 
U.S. Patent No. 5,627,625. 

Those skilled in the art will understand that the new results achieved with 
elliptical, 45 degree elliptical, square rings, and square shaped zones may also be 
achieved with alternative approaches, including the beamsplitter approach. This 
could be accomplished for example with the illumination system shown in Figure 20 
(and described in U.S. Patent No. 5,627,625) by shaping divided beams into elliptical, 
45 degree elliptical, and square shapes in the beamsplitter unit 16 and superposing 
them using a prism unit 17. Gaussian or similar shaped energy distribution is 
possible. 

Those skilled in the art will also understand that the new results achieved with 
elliptical, 45 degree elliptical, square rings, and square shaped zone may also be 
achieved with difiractive optical element approaches to beam shaping, such as that 
described in U.S. Patent No. 5,926,257 and U. S. Patent No. 5,631,721 by tailoring 

the diffractive optical elements to exhibit these characteristics. The micro-diffiactive 
optical elements within the beam-shaping optical system (2) in Figure 25 are 
manipulated to allow for the required shaping. It is well known that diffractive 
optical elements (DOEs) and halographic optical elements (HOEs) can allow for 
efficient manipulation of arbitrary wavefronts with more flexibility and reduced 
fabrication requirements compared to conventional refractive optics (see for instance 
Z. Yang and K. Rosenbruch, SPIE Vol. 1354, (1990), 323), One DOE or HOB or the 
combination of two or more DOEs or HOEs allow for the design flexibility needed to 
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achieve the desired results, as demonstrated for instance in U.S. Patent No. 5,926,257 
and in SPIE Vol. 1354, (1990), 323. 

The present invention is described above but it is to be understood that it is not 
limited to these descriptive examples. The numerical values, number of zones, 
shapes, and limiting zones may be changed to accommodate specific conditions of 
masking, aberration, feature orientation, duty ratio requirements, lens parameters, 
initial illumination non-uniformities, and the like as required to achieve high 
integrated circuit pattern resolution. Results can also be obtained by controlling 
illumination at any Fourier Transform plan in the illumination system. 
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